Maxwell–Boltzmann statistics

In statistical mechanics, Maxwell–Boltzmann statistics describes the statistical distribution of material particles over various energy states in thermal equilibrium, when the temperature is high enough and density is low enough to render quantum effects negligible.

The expected number of particles with energy \varepsilon_i for Maxwell–Boltzmann statistics is N_i where:


N_i = \frac {g_i} {e^{(\varepsilon_i-\mu)/kT}} = \frac{N}{Z}\,g_i e^{-\varepsilon_i/kT}

where:

N=\sum_i N_i\,
Z=\sum_i g_i e^{-\varepsilon_i/kT}

Equivalently, the distribution is sometimes expressed as


N_i = \frac {1} {e^{(\varepsilon_i-\mu)/kT}} = \frac{N}{Z}\,e^{-\varepsilon_i/kT}

where the index i  now specifies a particular state rather than the set of all states with energy \varepsilon_i, and Z=\sum_i e^{-\varepsilon_i/kT}

Fermi–Dirac and Bose–Einstein statistics apply when quantum effects are important and the particles are "indistinguishable". Quantum effects appear if the concentration of particles (N/V) ≥ nq. Here nq is the quantum concentration, for which the interparticle distance is equal to the thermal de Broglie wavelength, so that the wavefunctions of the particles are touching but not overlapping. Fermi–Dirac statistics apply to fermions (particles that obey the Pauli exclusion principle), and Bose–Einstein statistics apply to bosons. As the quantum concentration depends on temperature; most systems at high temperatures obey the classical (Maxwell–Boltzmann) limit unless they have a very high density, as for a white dwarf. Both Fermi–Dirac and Bose–Einstein become Maxwell–Boltzmann statistics at high temperature or at low concentration.

Maxwell–Boltzmann statistics are often described as the statistics of "distinguishable" classical particles. In other words the configuration of particle A in state 1 and particle B in state 2 is different from the case where particle B is in state 1 and particle A is in state 2. This assumption leads to the proper (Boltzmann) distribution of particles in the energy states, but yields non-physical results for the entropy, as embodied in the Gibbs paradox. This problem disappears when it is realized that all particles are in fact indistinguishable. Both of these distributions approach the Maxwell–Boltzmann distribution in the limit of high temperature and low density, without the need for any ad hoc assumptions. Maxwell–Boltzmann statistics are particularly useful for studying gases. Fermi–Dirac statistics are most often used for the study of electrons in solids. As such, they form the basis of semiconductor device theory and electronics.

Contents

A derivation of the Maxwell–Boltzmann distribution

Suppose we have a container with a huge number of very small identical particles. Although the particles are identical, we still identify them by drawing numbers on them in the way lottery balls are being labelled with numbers and even colors.

All of those tiny particles are moving inside that container in all directions with great speed. Because the particles are speeding around, they do possess some energy. The Maxwell–Boltzmann distribution is a mathematical function that speaks about how many particles in the container have a certain energy.

It can be so that many particles have the same amount of energy \varepsilon_i. The number of particles with the same energy \varepsilon_i is N_i. The number of particles possessing another energy \varepsilon_j is N_j. In physical speech this statement is lavishly inflated into something complicated which states that those many particles N_i with the same energy amount \varepsilon_i, all occupy a so called "energy level" i . The concept of energy level is used to graphically/mathematically describe and analyse the properties of particles and events experienced by them. Physicists take into consideration the ways particles arrange themself and thus there is more than one way of occupying an energy level and that's the reason why the particles were tagged like lottery ball, to know the intentions of each one of them.

To begin with, let's ignore the degeneracy problem: assume that there is only one single way to put N_i particles into energy level i . What follows next is a bit of combinatorial thinking which has little to do in accurately describing the reservoir of particles.

The number of different ways of performing an ordered selection of one single object from N objects is obviously N. The number of different ways of selecting two objects from N objects, in a particular order, is thus N(N − 1) and that of selecting n objects in a particular order is seen to be N!/(N − n)!. The number of ways of selecting 2 objects from N objects without regard to order is N(N − 1) divided by the number of ways 2 objects can be ordered, which is 2!. It can be seen that the number of ways of selecting n objects from N objects without regard to order is the binomial coefficient: N!/(n!(N − n)!). If we now have a set of boxes labelled a, b, c, d, e, ..., k, then the number of ways of selecting Na objects from a total of N objects and placing them in box a, then selecting Nb objects from the remaining N − Na objects and placing them in box b, then selecting Nc objects from the remaining N − Na − Nb objects and placing them in box c, and continuing until no object is left outside is


\begin{align}
W & = \frac{N!}{N_a!(N-N_a)!} \times \frac{(N-N_a)!}{N_b!(N-N_a-N_b)!} ~ \times \frac{(N-N_a-N_b)!}{N_c!(N-N_a-N_b-N_c)!} \times \ldots \times \frac{(N-\ldots-N_l)!}{N_k!(N-\ldots-N_l-N_k)!} = \\ \\
& = \frac{N!}{N_a!N_b!N_c!\ldots N_k!(N-\ldots-N_l-N_k)!}
\end{align}

and because not even a single object is to be left outside the boxes, implies that the sum made of the terms Na, Nb, Nc, Nd, Ne, ..., Nk must equal N, thus the term (N - Na - Nb - Nc - ... - Nl - Nk)! in the relation above evaluates to 0! which makes possible to write down that relation as


\begin{align}
W & = N!\prod_{i=a,b,c,...}^k \frac{1}{N_i!}
\end{align}

Now going back to the degeneracy problem which characterize the reservoir of particles. If the i-th box has a "degeneracy" of g_i, that is, it has g_i "sub-boxes", such that any way of filling the i-th box where the number in the sub-boxes is changed is a distinct way of filling the box, then the number of ways of filling the i-th box must be increased by the number of ways of distributing the N_i objects in the g_i "sub-boxes". The number of ways of placing N_i distinguishable objects in g_i "sub-boxes" is g_i^{N_i}. Thus the number of ways W that a total of N particles can be classified into energy levels according to their energies, while each level i having g_i distinct states such that the i-th level accommodates N_i particles is:

W=N!\prod \frac{g_i^{N_i}}{N_i!}

This is the form for W first derived by Boltzmann. Boltzmann's fundamental equation S=k\,\ln W relates the thermodynamic entropy S to the number of microstates W, where k is the Boltzmann constant. It was pointed out by Gibbs however, that the above expression for W does not yield an extensive entropy, and is therefore faulty. This problem is known as the Gibbs paradox The problem is that the particles considered by the above equation are not indistinguishable. In other words, for two particles (A and B) in two energy sublevels the population represented by [A,B] is considered distinct from the population [B,A] while for indistinguishable particles, they are not. If we carry out the argument for indistinguishable particles, we are led to the Bose-Einstein expression for W:

W=\prod_i \frac{(N_i%2Bg_i-1)!}{N_i!(g_i-1)!}

Both the Maxwell-Boltzmann distribution and the Bose-Einstein distribution are only valid for temperatures well above absolute zero, implying that g_i\gg 1. The Maxwell-Boltzmann distribution also requires low density, implying that g_i\gg N_i. Under these conditions, we may use Stirling's approximation for the factorial:


N! \approx N^N e^{-N},

to write:

W\approx\prod_i \frac{(N_i%2Bg_i)^{N_i%2Bg_i}}{N_i^{N_i}g_i^{g_i}}\approx\prod_i \frac{g_i^{N_i}(1%2BN_i/g_i)^{g_i}}{N_i^{N_i}}

Using the fact that (1%2BN_i/g_i)^{g_i}\approx e^{N_i} for g_i\gg N_i we can again use Stirlings approximation to write:

W\approx\prod_i \frac{g_i^{N_i}}{N_i!}

This is essentially a division by N! of Boltzmann's original expression for W, and this correction is referred to as correct Boltzmann counting.

We wish to find the N_i for which the function W is maximized, while considering the constraint that there is a fixed number of particles \left(N=\textstyle\sum N_i\right) and a fixed energy \left(E=\textstyle\sum N_i \varepsilon_i\right) in the container. The maxima of W and \ln(W) are achieved by the same values of N_i and, since it is easier to accomplish mathematically, we will maximize the latter function instead. We constrain our solution using Lagrange multipliers forming the function:


f(N_1,N_2,\ldots,N_n)=\ln(W)%2B\alpha(N-\sum N_i)%2B\beta(E-\sum N_i \varepsilon_i)

\ln W=\ln\left[\prod\limits_{i=1}^{n}\frac{g_i^{N_i}}{N_i!}\right] \approx \sum\limits_{i=1}^n\left(N_i\ln g_i-N_i\ln N_i %2B N_i\right)

Finally


f(N_1,N_2,\ldots,N_n)=\alpha N %2B\beta E %2B
\sum\limits_{i=1}^n\left(N_i\ln g_i-N_i\ln N_i %2B N_i-(\alpha%2B\beta\varepsilon_i) N_i\right)

In order to maximize the expression above we apply Fermat's theorem (stationary points), according to which local extrema, if exist, must be at critical points (partial derivatives vanish):


\frac{\partial f}{\partial N_i}=\ln g_i-\ln N_i -(\alpha%2B\beta\varepsilon_i) = 0

By solving the equations above (i=1\ldots n) we arrive to an expression for N_i:


N_i = \frac{g_i}{e^{\alpha%2B\beta \varepsilon_i}}

Substituting this expression for N_i into the equation for \ln W and assuming that N\gg 1 yields:

\ln W = \alpha N%2B\beta E\,

or, differentiating and rearranging:

dE=\frac{1}{\beta}d\ln W-\frac{\alpha}{\beta}dN

Boltzmann realized that this is just an expression of the second law of thermodynamics. Identifying dE as the internal energy, the second law of thermodynamics states that for variation only in entropy (S) and particle number (N):

dE=T\,dS%2B\mu\,dN

where T is the temperature and μ is the chemical potential. Boltzmann's famous equation S=k\,\ln W is the realization that the entropy is proportional to \ln W with the constant of proportionality being Boltzmann's constant. It follows immediately that \beta=1/kT and \alpha=-\mu/kT so that the populations may now be written:


N_i = \frac{g_i}{e^{(\varepsilon_i-\mu)/kT}}

Note that the above formula is sometimes written:


N_i = \frac{g_i}{e^{\varepsilon_i/kT}/z}

where z=\exp(\mu/kT) is the absolute activity.

Alternatively, we may use the fact that

\sum_i N_i=N\,

to obtain the population numbers as


N_i = N\frac{g_i e^{-\varepsilon_i/kT}}{Z}

where Z is the partition function defined by:


Z = \sum_i g_i e^{-\varepsilon_i/kT}

Another derivation (not as fundamental)

In the above discussion, the Boltzmann distribution function was obtained via directly analysing the multiplicities of a system. Alternatively, one can make use of the canonical ensemble. In a canonical ensemble, a system is in thermal contact with a reservoir. While energy is free to flow between the system and the reservoir, the reservoir is thought to have infinitely large heat capacity as to maintain constant temperature, T, for the combined system.

In the present context, our system is assumed to have the energy levels \varepsilon _i with degeneracies g_i. As before, we would like to calculate the probability that our system has energy \varepsilon_i.

If our system is in state \; s_1, then there would be a corresponding number of microstates available to the reservoir. Call this number \; \Omega _ R (s_1). By assumption, the combined system (of the system we are interested in and the reservoir) is isolated, so all microstates are equally probable. Therefore, for instance, if  \; \Omega _ R (s_1) = 2 \; \Omega _ R (s_2) , we can conclude that our system is twice as likely to be in state \; s_1 than \; s_2. In general, if \; P(s_i) is the probability that our system is in state \; s_i,

\frac{P(s_1)}{P(s_2)} = \frac{\Omega _ R (s_1)}{\Omega _ R (s_2)}.

Since the entropy of the reservoir \; S_R = k \ln \Omega _R, the above becomes

\frac{P(s_1)}{P(s_2)} = \frac{ e^{S_R(s_1)/k} }{ e^{S_R(s_2)/k} } = e^{(S_R (s_1) - S_R (s_2))/k}.

Next we recall the thermodynamic identity (from the first law of thermodynamics):

d S_R = \frac{1}{T} (d U_R %2B P \, d V_R - \mu \, d N_R).

In a canonical ensemble, there is no exchange of particles, so the d N_R term is zero. Similarly, d V_R = 0. This gives

 S_R (s_1) - S_R (s_2) = \frac{1}{T} (U_R (s_1) - U_R (s_2)) = - \frac{1}{T} (E(s_1) - E(s_2)),

where \; U_R (s_i) and \; E(s_i) denote the energies of the reservoir and the system at s_i, respectively. For the second equality we have used the conservation of energy. Substituting into the first equation relating P(s_1), \; P(s_2):


\frac{P(s_1)}{P(s_2)} =  \frac{ e^{ - E(s_1) / kT } }{ e^{ - E(s_2) / kT} },

which implies, for any state s of the system


P(s) = \frac{1}{Z} e^{- E(s) / kT},

where Z is an appropriately chosen "constant" to make total probability 1. (Z is constant provided that the temperature T is invariant.) It is obvious that

\; Z = \sum _s e^{- E(s) / kT},

where the index s runs through all microstates of the system. Z is sometimes called the Boltzmann sum over states (or "Zustandsumme" in the original German). If we index the summation via the energy eigenvalues instead of all possible states, degeneracy must be taken into account. The probability of our system having energy \varepsilon _i is simply the sum of the probabilities of all corresponding microstates:

P (\varepsilon _i) = \frac{1}{Z} g_i e^{- \varepsilon_i / kT}

where, with obvious modification,

Z = \sum _j g_j  e^{- \varepsilon _j / kT},

this is the same result as before.

Comments

Limits of applicability

The Bose–Einstein and Fermi–Dirac distributions may be written:


N_i = \frac{g_i}{e^{(\varepsilon_i-\mu)/kT}\mp 1}.

Assuming the minimum value of \varepsilon_i is small, it can be seen that the condition under which the Maxwell–Boltzmann distribution is valid is when

e^{-\mu/kT} \gg 1. \,

For an ideal gas, we can calculate the chemical potential using the development in the Sackur–Tetrode article to show that:

\mu=\left(\frac{\partial E}{\partial N}\right)_{S,V}=-kT\ln\left(\frac{V}{N\Lambda^3}\right)

where E is the total internal energy, S is the entropy, V is the volume, and \Lambda is the thermal de Broglie wavelength. The condition for the applicability of the Maxwell–Boltzmann distribution for an ideal gas is again shown to be

\frac{V}{N\Lambda^3}\gg 1.

See also

References

Bibliography